CHAPTER 9 Summarizing and Graphing Your Data 115

How can you convey a visual picture of what the true distribution may look like by

using just a few summary numbers? By reporting values of measures of some

important characteristics of these distributions, so that the reader can infer the

shape. This is similar to learning that one Olympic ice skater scored an average of

9.0 compared to another who scored an average of 5.0. You will not know what the

skate routines looked like unless you watch them, but the score will already tell

you that if you were to watch them, you would expect to see that the one that

scored 9.0 was executed in a more visually pleasing way than the one that

scored 5.0.

Frequency distributions have names for their important characteristics, including:»

» Center: Where along the distribution of the values do the numbers tend

to center?»

» Dispersion: How much do these numbers spread out?»

» Symmetry: If you were to draw a vertical line down the middle of the

distribution, does the distribution shape appear as if the vertical line is a

mirror, reflecting an identical shape on both sides? Or do the sides look

noticeably different — and if so, how?»

» Shape: Is the top of the distribution nicely rounded, or pointier, or flatter?

Like using average skating scores to describe the visual appeal of an Olympic skate

routine, to describe a distribution you need to calculate and report numbers that

measure each of these four characteristics. These characteristics are what we

mean by summary statistics for numerical variables.

Locating the center of your data

When you start exploring a set of numbers, an important first step is to determine

what value they tend to center around. This characteristic is called, intuitively

enough, central tendency. Many statistical textbooks describe three measures of

central tendency: mean (which is the same as average), median, and mode. You may

assume these are the three optimal measures to describe a distribution (because

they all begin with m and are easy to remember). But all three have limitations,

especially when dealing with data obtained from samples in human research, as

described in the following sections.

Arithmetic mean

The arithmetic mean, also commonly called the mean (or the average), is the most

familiar and most often quoted measure of central tendency. Throughout this

book, whenever we use the two-word term the mean, we’re referring to the